Microsoft OmniParser V2.0 is a next-gen AI visual parsing tool that converts GUI to structured data, with faster speed, higher accuracy, and seamless LLM integration.
CogAgent-9B: A 9B-parameter GUI agent by Zhipu AI and Tsinghua University that excels in interface understanding and automation, outperforming other models in MM-Vet and more benchmarks